Value-Based Planning for Teams of Agents in Stochastic Partially Observable Environments

نویسندگان

  • Frans A. Oliehoek
  • Frans Adriaan Oliehoek
  • L. P. Kaelbling
چکیده

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems

This article describes the Multiagent Decision Process (MADP) Toolbox, a software library to support planning and learning for intelligent agents and multiagent systems in uncertain environments. Key features are that it supports partially observable environments and stochastic transition models; has unified support for singleand multiagent systems; provides a large number of models for decisio...

متن کامل

Case-Based Behavior Recognition to Facilitate Planning in Unmanned Air Vehicles

An unmanned air vehicle (UAV) can operate as a capable team member in mixed human-robot teams if the agent that controls it can intelligently plan. However, planning effectively in an air combat scenario requires understanding the behaviors of hostile agents in that scenario, which is challenging in partially observable environments such as the one we study. We present a Case-Based Behavior Rec...

متن کامل

Coordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach

Distributed partially observable Markov decision problems (POMDPs) have emerged as a popular decision-theoretic approach for planning for multiagent teams, where it is imperative for the agents to be able to reason about the rewards (and costs) for their actions in the presence of uncertainty. However, finding the optimal distributed POMDP policy is computationally intractable (NEXPComplete). T...

متن کامل

Enabling Supportive Communications in Decentralized Multi-Agent Teams

Supportive communication is an effective collaboration behavior identified in human teams in which team members share information proactively to improve overall team performance. Prior work formulated this objective as the Single-Agent in a Team Decision Problem (SAT-DP) where agents decide whether or not to communicate an unexpected observation during execution time. We extend the SAT-DP defin...

متن کامل

A Framework for Optimal Sequential Planning in Multiagent Settings

Introduction Research in autonomous agent planning is gradually moving from single-agent environments to those populated by multiple agents. In single-agent sequential environments, partially observable Markov decision processes (POMDPs) provide a principled approach for planning under uncertainty. They improve on classical planning by not only modeling the inherent non-determinism of the probl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009